PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG005074t1
Common NameTCM_005074
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family HD-ZIP
Protein Properties Length: 336aa    MW: 37779.8 Da    PI: 6.0864
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG005074t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox56.54.7e-183084256
                      T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
          Homeobox  2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56
                      +++++f++eq++ Le +Fe+++ +  +++ + A++lgL+ rqV +WFqN+Ra++k
  Thecc1EG005074t1 30 KNKRRFSDEQIKSLELMFESESRLEPRKKVQVARELGLQPRQVAIWFQNKRARWK 84
                      67889*************************************************9 PP

2HD-ZIP_I/II119.12.3e-3831122293
       HD-ZIP_I/II   2 kkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLreelke 93 
                       +krr+s+eq+k+LE +Fe+e++Lep++Kv++areLglqprqva+WFqn+RAR+k+kqlEkdy  L+++y++l+++ e+L+ke+++L  +l++
  Thecc1EG005074t1  31 NKRRFSDEQIKSLELMFESESRLEPRKKVQVARELGLQPRQVAIWFQNKRARWKSKQLEKDYGILQANYNNLASKFESLKKEKQALVIQLQK 122
                       79************************************************************************************999986 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466898.17E-181288IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.603.0E-181685IPR009057Homeodomain-like
PROSITE profilePS5007117.8312686IPR001356Homeobox domain
SMARTSM003891.2E-152890IPR001356Homeobox domain
PfamPF000462.7E-153084IPR001356Homeobox domain
CDDcd000861.20E-143387No hitNo description
PRINTSPR000314.8E-55766IPR000047Helix-turn-helix motif
PROSITE patternPS0002706184IPR017970Homeobox, conserved site
PRINTSPR000314.8E-56682IPR000047Helix-turn-helix motif
PfamPF021831.3E-1586127IPR003106Leucine zipper, homeobox-associated
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0009414Biological Processresponse to water deprivation
GO:0009738Biological Processabscisic acid-activated signaling pathway
GO:0045893Biological Processpositive regulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0016021Cellular Componentintegral component of membrane
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 336 aa     Download sequence    Send to blast
MLDGEEYTQE MPEPFASMNE VTTTKKKKNK NKRRFSDEQI KSLELMFESE SRLEPRKKVQ  60
VARELGLQPR QVAIWFQNKR ARWKSKQLEK DYGILQANYN NLASKFESLK KEKQALVIQL  120
QKLNDLLKKP KEEGQCCGQV TTTNSIDGDS EKGEAVKSDS EGQLSLSMER SEHALGVLSD  180
DDSVIKTDYF GLEEEPSLIN MVEPADGSLT SPEDWRSFDS VGLFDQSGSG YQCFPGVYHF  240
RPGSRKLEFT VYSEIMGNVC VGMGTERPRK VEQGVHSNMT YGHSARTLKD GIQIPTLQNP  300
IFFLFSLSPC CFSGGVLLFA QLFHPARDSM GLVGQ*
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00320DAPTransfer from AT2G46680Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007051465.10.0Homeobox 7
SwissprotP468978e-74ATHB7_ARATH; Homeobox-leucine zipper protein ATHB-7
TrEMBLA0A061DTA70.0A0A061DTA7_THECC; Homeobox 7
STRINGPOPTR_0014s09860.11e-112(Populus trichocarpa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM24982874
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G46680.17e-72homeobox 7
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]